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Abstract 

Zero-range processes with decreasing jump rates exhibit a condensation transition, where a positive 
fraction of all particles condenses on a single lattice site when the total density exceeds a critical 
value. We study the onset of condensation, i.e. the behaviour of the maximum occupation number 
after adding or subtracting a subextensive excess mass of particles at the critical density. We establish 
a law of large numbers for the excess mass fraction in the maximum, which turns out to jump 
from zero to a positive value at a critical scale. Our results also include distributional limits for the 
fluctuations of the maximum, which change from standard extreme value statistics to Gaussian when 
the density crosses the critical point. Fluctuations in the bulk are also covered, showing that the mass 
outside the maximum is distributed homogeneously. In summary, we identify the detailed behaviour 
at the critical scale including sub-leading terms, which provides a full understanding of the crossover 
from sub- to supercritical behaviour. 
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1 Introduction 

The zero-range process is one of the interacting particle systems introduced in the seminal paper [26]. 
The process has unbounded local state space, i.e. there is no restriction on the number of particles 
per site, and the jump rate g(n) at a given site depends only on the number of particles n at that site. 
This simple zero-range interaction leads to a product structure of the stationary measures [1, 26] and 
further interest was initially on the existence of the dynamics under general conditions [1] and on 
establishing hydrodynamic limits. These questions have been successfully addressed in the case of 
attractive zero-range processes when g(n) is a non-decreasing function, and results are summarized in 
[23]. For such processes with additional space dependence of the rates g x , there is also a number of 
rigorous results regarding condensation on slow sites [2, 18, 24]. 

More recently, there has been increasing interest in zero-range processes with spatially homogeneous 
jump rates g(n) decreasing with the number of particles. This results in an effective attraction of the 
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particles and can lead to condensation phenomena. A generic family of models with that property has 
been introduced in the theoretical physics literature [14], with asymptotic behaviour of the jump rates 



n) ~ 1 H r- as n — > oo . 



(1.1) 



For A G (0, 1), b > and for A = 1, b > 2 the following phase transition was established using 
heuristic arguments: If the particle density p exceeds a critical value p c , the system phase separates into 
a homogeneous background with density p c and a condensate, a single randomly located lattice site 
that contains a macroscopic fraction of all the particles. This type of condensation appears in diverse 
contexts such as traffic jamming, gelation in networks, or wealth condensation in macro-economies, 
and zero range processes or simple variants have been used as prototype models to explain these 
phenomena (see [15] for a review). 

The existence of invariant measures with simple product structure makes the problem mathematically 
tractable. Jeon, March and Pittel showed in [22] that for some cases of zero-range processes the 
maximum site contains a non-zero fraction of all the particles. Condensation has been established 
rigorously in [21] by proving the equivalence of ensembles in the thermodynamic limit, where the 
lattice size L and the number of particles N tend to infinity such that N/L — > p. This implies 
convergence of finite-dimensional marginals of stationary measures conditioned on a total particle 
number N, to stationary product measures with density p in the subcritical case p < p c , and with 
density p c in the supercritical case p > p c . In the latter case the condition on the particle number is an 
atypical event which is most likely realized by a large deviation of the maximum component, and the 
problem can be described as Gibbs conditioning for measures without exponential moments. It turns 
out (cf. [3]) that a strong form of the equivalence holds in the supercritical case, which determines 
the asymptotic distribution of the particles on all L sites. A similar result has been established in [17] 
on a lattice of fixed size L in the limit N — > oo, and the local equivalence of ensembles result was 
generalized to processes with several particle species in [20]. More recent rigorous results address 
metastability for the motion of the condensate [5, 6]. 

In this paper we study the properties of the condensation transition at the critical density p c for the 
processes introduced in [14] with rates (1.1), to understand the onset of the condensate formation. We 
consider the thermodynamic limit with N/L — > p c , with the excess mass N — p c L is on a scale o(L). 
Our results are formulated in Section 2.2 and provide a rather complete picture of the transition from 
a homogeneous subcritical to condensed supercritical behaviour. It turns out that the condensate forms 
suddenly on a critical scale N — p c L <~ Ax,, which is identified in Theorems 2.1 and 2.3 to be 



Our results imply a weak law of large numbers for the ratio Ml / (N — p c L) where Ml is the maximum 
occupation number, which is illustrated in Figure 1. The ratio exhibits a sudden jump from to a 
positive value when the excess mass reaches the critical size Ax. At this point both values can occur 
with positive probability depending on sub-leading orders of the excess mass, which is discussed in 
detail in Section 2.3. For A = 1 the full excess mass is concentrated in the maximum right above the 
critical scale. On the other hand, for A e (0,1) the excess mass is shared between the condensate and 
the bulk, and the condensate fraction increases from 2A/(1 + A) to 1 only as (N — p c L)/Ai — > oo. 
Theorem 2.5 provides results for the bulk fluctuations, which imply that the mass outside the maximum 
is always distributed homogeneously and the system typically contains at most one condensate site. 
Theorems 2.1 and 2.3 also cover the fluctuations of the maximum, which change from standard extreme 
value statistics to Gaussian. This is complemented by Theorems 2.2 and 2.4 on downside deviations, 




y/(b-3)LlogL for A = 1, b > 3 and 
x (a 2 L)rh: for Ae (0,1), b>0. 



(1.2) 
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Figure 1: Illustration of the law of large numbers for the excess mass fraction L in the condensate on the 
critical scale A_l (1.2). For A = 1 the results are given in Theorem 2.1 ((2.21) and (2.23)) and for A £ (0, 1) 
in Theorem 2.3 ((2.32) and (2.34)). The behaviour at 1 depends on the sub-leading terms in the excess mass, as 
detailed in (2.19) and (2.36). 



which give a detailed description of the crossover to the expected Gumbel distributions in the subcritical 
regime (p < p c ), where the marginals have exponential tails. In [16] the fluctuations of the maximum 
for A = 1 were observed by the use of saddle point computations to change from Gumbel (p < p c ), 
via Frechet (p — p c ), to Gaussian or stable law fluctuations (p > p c ), raising the question on how 
the transition between these different regimes occurs. Our results around the critical point provide a 
detailed, rigorous answer to that question, covering also the case A G (0, 1). We use previous results 
on local limit theorems for moderate deviations of random variables with power-law distribution [13] 
for the case A = 1, and stretched exponential distribution [25] for A G (0,1). In the latter case we can 
also extend the results for p > p c (Corollary 2.6) to parameter values that were not covered by previous 
results [3]. 

In general, the onset of phase separation and phase coexistence at the critical scale is a classical question 
of mathematical statistical mechanics. This has been studied for example in the Ising model and related 
liquid/vapour systems in [8, 9], where a major point is the shape of critical 'droplets' . Here we treat this 
question in the case of zero-range condensation, where the main mathematical challenges are related to 
subexponential scales and a lack of symmetry between the fluid and condensed phase. The condensate 
turns out to always concentrate on a single lattice site (even at criticality), and contains a positive frac- 
tion of the excess mass. In contrast to liquid/vapour systems, this fraction is not 'universal', but depends 
on the system parameter A (see also discussion in Section 2.3). From a mathematical point of view, the 
analysis includes interesting connections to extreme value statistics and large deviations for subexpo- 
nential random variables, which in itself is an area of recent research interest (see [12, 4] and references 
therein). Our results also provide a detailed understanding of finite-size effects and metastability close 
to the critical point, which are important in applications such as traffic flow and granular clustering (see 
[10] and references therein). 
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2 Definitions and results 



2.1 The zero-range process and condensation 

We consider the zero-range processes on a finite set of size L. Given a jump rate function g : No = 
{0, 1,2,...} i y [0, +oo) such that g(n) = <^=> n = and a set of transition probabilities p(-, •) on 
Ai x Al, a zero range process is defined as a Markov process on the state space X L = No L of all 
particle configurations 

rj = (t] x : x e A L ) , (2.1) 
where rj x <E No is the local occupation number at site x. The dynamics is given by the generator 

cm= E 9(v x )p(x,y){f(v x ' y )-m) (2.2) 

{r] x -l, z = x and r\ x > 
rjy + 1 , z = y and rfc > 
rj z , otherwise. 

For a technical discussion of the domain of test functions / of the generator and the corresponding 
construction of the semigroup we refer to [1]. The practical meaning of (2.2) is that any given site 
x looses one particle with rate g{rj x ) and this particle then jumps to site y with probability p(x,y). 
To avoid degeneracies p(x, y) should be irreducible transition probabilities of a random walk on A^,. 
This way, the number of particles is the only conserved quantity of the process, leading to a family of 
stationary measures indexed by the particle density. In the following we are interested in the situation 
where these measures are spatially homogeneous. This is guaranteed by the condition that the harmonic 
equations 

x£A L 

have the constant solution \ x = 1, and by the irreducibility of p(x, y) this implies that every solution is 
constant. This is for example the case if Al is a regular periodic lattice and p(x, y) is translation invari- 
ant, such as Al = Z/LZ andp(x,y) = S y . x+ i for totally asymmetric or p(x, y) = \8 y , x +i + \5 y , x -i 
for symmetric nearest-neighbour hopping. 

It is well known (see e.g. [1, 26]) that under the above conditions the zero-range process has a family of 
stationary homogeneous product measures v^. The occupation numbers ri x are i.i.d. random variables 
with marginal distribution 

1 ™ 1 

v$ [Vx =n] = — — w(n) <p n where w(n) = TT — — . (2.4) 

z{4>) k=i 9 ^ 

The parameter <p of the stationary measures is called the fugacity, and the measures exist for all <j> > 
such that the normalization (partition function) is finite, i.e. 

00 

z{(j,) : = w(n)(f> n < 00 . (2.5) 

n=0 

The particle density as a function of can be computed as 

R{4>) := W* [ Vx ] = 4> 9 logz(0), (2.6) 
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and turns out to be strictly increasing and continuous with R(0) = 0. 



In this paper we consider the family of models introduced in [14], where the jump rates have asymptotic 
behaviour 

, x b 

gin) ~ H r- asn^oo, (2.7) 

n A 

with b > and A e (0, 1]. In (2.7) and hereafter we use the notation a n ~ b n as n — > oo, if 
lim n ^oo a n /b n — 1. We will also write a n <~ 6„ as n — > oo if there is a constant C > 1 such that 
C _1 < a n /b n < C for sufficiently large n. With (2.4) this definition of jump rates leads to stationary 
weights with asymptotic power law decay 

w{n) ~ A in - b for A = 1 , (2.8) 

and stretched exponential decay 

w(n) ~ A x cxp ( - -J^n 1 -^ for A e (0,1), (2.9) 

with constant prefactors Aa- 

In the second case the distributions (2.4) are well defined for all <f) € [0, 1] with finite maximal 
(critical) density 

p c := R(l) < oo (2.10) 
and finite corresponding variance 

a 2 : = E Ul [rj 2 .] — pi < oo . (2.11) 

If A = 1 the corresponding variance is finite if b > 3, which we will assume hereafter. The case 
2 < b < 3 is not covered by our main results, and we discuss it shortly in Section 2.3. In general, 
(2.9) also contains terms of lower order n 1 ^ kx , k > 2, in the exponent, which may contribute to the 
asymptotic behaviour for A e (0, 1/2] depending on the subleading terms in the jump rates (2.7). To 
avoid these complications when A < 1/2, we focus on processes with rates (2.7) for which (2.9) holds. 
The simplest way to meet this condition is to choose g(n) = w(n — l)/w(n), n > 1, with w(n) as in 
the right hand of (2.9) with A x = 1. 

It has been shown in [14, 21] that when the critical density is finite the system exhibits a condensation 
transition that can be quantified as follows. Since the number of particles is conserved by the micro- 
scopic dynamics for each TV e N, the subspaces 

X l .n = {v G X l : S L {r]) = N} where S L (ri) = J2 ** (2A2 ^ 

xeA L 

are invariant. The zero range process is irreducible over each of these subspaces and the unique invariant 
measure supported on Xl,n is given by 

Hl,n = V4>[- \S L =N] . (2.13) 

It is not hard to see that the measures hl.n are independent of <j) on the right-hand side. A question of 
interest is the convergence of the measures pl,n in the thermodynamic limit L, N — > oo, N/L — > p. 
This is answered by the equivalence of ensembles principle, which states that in the limit the measures 
Pl,n locally behave like a product measure v$ for a suitable <f). Note that when p < p c there exists a 
unique <fi = <p(p) such that p = R((f>) , whereas if p > p c no such <j> exists. The equivalence of ensembles 
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precisely states that if / is a cylinder function, i.e. a function that only depends on the configuration r\ 
on a finite number of sites, then 

provided that (see [20] and Appendix 2.1 in [23]) 

R(4>) — p and / e L 2 (y<t,) for p < pc and 

= 0c = 1 and / bounded for p > p c . (2.15) 

The behaviour described above is accompanied by the emergence of a condensate, a site which contains 
0{L) particles. If p < p c one can easily check that the limiting measures v$( p ) have finite exponential 
moments and the size of the maximum component Ml(i]) = max ie A t Vx is typically O(logL). If on 
the other hand p > p c it has been shown in [22] for the power law case that 

L Ml ^ P - Pc . (2.16) 

The notation in (2.16), which we also use in the following, denotes convergence in probability w.r.t the 
conditional laws pl,n, i.e. 



pL,N 



hvl L - (p - p c ) 



> e 



foralle>0. (2.17) 



Equation (2.14) has been generalized in [3] for p > p c to test functions / depending on all sites but the 
maximally occupied one, and equation (2.16) is proved for stretched exponentials of the form (2.9) with 
A > 1/2, as well. An immediate corollary is that the size of the second largest component is typically 
o(L), which implies that the condensate typically covers only a single randomly located site. 

2.2 Main results 

In the following we study the distribution of the excess mass in the system at the critical point to fully 
understand the emergence of the condensate when the density increases from sub- to supercritical values. 
We consider the thermodynamic limit N/ L — »■ p c where the excess mass is on a sub-extensive scale 
\N — p c L\ = o(L). Our first theorem on the power-law case (2.8) relies on a result of Doney (Theorem 
2 in [13]) for the estimation of v^, a \Sl — N] . Precisely, for z := (N — p c L) /y/L — > oo, we get 

v 4>c [S L = N] = e-^(l + o(l)) + L v^ c [ Vx = [zVI] ] (l + o(l)) (2.18) 

as L — > oo. It turns out that when — p c L is close to the typical scale \[L (case a) in the Theorem 
below) the first term of the sum dominates the right hand side of (2. 1 8) and the excess mass is distributed 
homogeneously among the sites. On the other hand, when N — p c L is large enough (case (b)) it is the 
second term that dominates the right hand side of (2.18) and this implies the existence of a condensate 
that carries essentially all the excess mass. Finally, there is an intermediate scale (case (c)) where the 
two terms are of the same order and both scenarios can occur with positive probability. 

Theorem 2.1 (Upside moderate deviations, power law case) 

Let A = 1 and b > 3, so that a 2 < oo. Assume that N > p c L and define -jl E M by 
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o) If lh — 00 the distribution under /j,l,n of the maximum Ml is asymptotically equivalent to its 
distribution under v^ c . Precisely, for all x > we have 



lim p, L . 



M L 



< x 



lim 

L— >oo 



V 4>c 



M L 



< x 



exp 



Aix 1 - 



(2.20) 



In particular, if N — p c L > L'- 1 then 



Ml 



N-p c L 



0. 



(2.21) 



b) If 1l — > +°° normalized fluctuations of the maximum around the excess mass under Pl,n 
converge in distribution to a normal r.v, 



M L -(N- p c L) 



Af(0,l). 



(2.22) 



In particular, 



Mr 



N -p c L 

c) IfjL — > 7 € K we nave convergence in distribution to a Bernoulli random variable, 
Ml 



N- Pc L 



Be{p 1 ) , 



(2.23) 



(2.24) 



where p 1 G (0, 1) is racn that p 1 — > (1), as 7 — > —00 (+00). An explicit expression for p 7 is given in 
(3.76) ana! (3.77) in fne proofs section. 

The next result connects the fluctuations of the maximum to the extreme value statistics expected in the 
subcritical regime. 



Theorem 2.2 (Downside moderate deviations, power law case) 

Let A = 1 and b > 3 ana 1 ae/zne ujl > fry 

TV = p c L - cj L a- 2 L^ r T . 



(2.25) 



a) If uil — >■ fne distribution under Pl,n of the maximum Ml is asymptotically equivalent to its 
distribution under v$ c . Precisely, for all x > we get 



lim pl,n 

L—t-oo 



Mr 



< x 



exp 



fr) TfwL — > w > fnen fnere exisfs a positive constant v such that for all x > 



lim /x^jv 

L— >oo 



< X 



exp < —A\ 



■ dt 



(2.26) 



(2.27) 



c) If ujl — > 00 f«en fnere exisf sequences Bl — > 00 ana 1 sl = pc ^2 L N + °(1)) ~> °°» ^wc" 

that for all x gl 



lim ^l,at 

L— »oo 



Mr - 73, 



1/«L 



< X 



= cxp{-e x }. 



(2.28) 
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We return to a more detailed discussion of these results in Section 2.3 after stating the results for the 
stretched exponential tail (A < 1). For this case the counterpart of estimate (2.18) was obtained by 
A.V. Nagaev in [25], where the size of the maximum is also discussed and which is summarized in the 
appendix. In fact, a careful reading reveals that equation (2.33) below is already contained there. 



Theorem 2.3 (Upside moderate deviations, stretched exponential case) 

i 

Let A e (0, 1) and C\ = (1 + A)(2A) _T + X (rrj) * +A - Assume that N > p c L and define ti, > by 



N = p c L + t L {cr 2 L)^ 



(2.29) 



a) lim sup < c\ the distribution under /j,l,n of the maximum Ml is asymptotically equivalent to 
its distribution under v^ c . Precisely, there exist sequences dl^l such that 



VL 



and for all x £l we have 

~M L - y L 



lim p L .N 

L— yco 



< X 



lim v§ c 

L— 7-oo 



as L — > oo, 



M L - y L 



In particular, if N — p c L (logi) 1 ->- then 



< x 
Ml i*l,n 



0. 



(2.30) 



(2.31) 



(2.32) 



N- Pc L 

b)IftL — > t with C\ <t < +oo, there exists a sequence cll and a function a(t), — > a(t), such that 



M L -(N- p c L)a L ^ 



TV 



0. 



i A ( 1 - a («)) , 

1 w) — / 



(2.33) 



In particular, 



M L 



fJ>L,N 



a(t). 



(2.34) 



N- Pc L 

The sequence cil is implicitly defined by (A.7) in the Appendix (with cil = 1 — aand^y = 6/(1 — A)), and 
when N — p c L 3> we may take cll = 1. The limit a(t) in the preceding equation is an increasing 
function oft with 

2A 



lim a(t) 
tic x 1 + A 



and 



lim a(t) = a(+oo) = 1. 

tfoo 



c)Ifth — > C\, assume that A > 1/2 and suppose 

N = p c L + c x (cr 2 L) tta _ 1+^(^)1^ (3 i gL + lL ) 

with 7l — > 7 G K. TTien we /zave convergence to a Bernoulli random variable, 

Ml vl,n 2A 



1 + A 



(2.35) 



(2.36) 



(2.37) 



where p 1 G (0, 1) is such thatp 1 — »■ (1) /or 7 -4- —00 (+00). An explicit expression for p^ is given in 
(3.21). 



In c) analogous statements also hold for the case A < 1/2, which can be derived from the results in 
[25] summarized in the appendix. However, the order of the sub-leading scale depends on the first few 
Cramer coefficients of the distribution, and results cannot be formulated in an explicit form as above. 
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Theorem 2.4 (Downside moderate deviations, stretched exponential case) 

Let A < 1 and define u>l > by 

N = p c L - cu L L (logi) - !^. 

IfoJL — >• c G [0, +oo] there exist sequences 71, ana! (l, both increasing to 00 with L, such that 

lim p L . N [M L <j l +x(l}= eT^ , (2.38) 

Tjf wl — > the distribution under Pl,n of the maximum Ml is asymptotically equivalent to its distri- 
bution under v^ c . Precisely, if yh and are the sequences introduced in Theorem 2. 3. a), we can take 
7l = Vl and ( L = b L to get 

lim fi L ^ N [M L < y L + x b L ] = e~ e * , 

L— >oo 

Our final result focuses on the fluctuations of the bulk outside the maximum. 

Theorem 2.5 (Fluctuations of the bulk) 

Assume A € (0, 1), or A = 1 and b > 3 so that a 2 < +00. 

a) In the subcritical regime, that is if f -^> , the distribution under pl.n of the bulk fluctuation 
process converges in the Skorokhod space to a standard Brownian bridge conditioned to return to the 
origin at time 1, i.e. 

b) In the supercritical regime, that is if N — p c L > Aj, and N M ^ L —4- n, k a positive constant, the 
distribution under fiL,N of the bulk fluctuation process, converges in the Skorokhod space to a standard 
Brownian bridge plus an independent, random drift term. Precisely, if f] x — r) x \{r\ x < L 1 / 4 } then 

where $ ~Af(o, 1/(1 - A(1 ~° (t)) )^ is an independent random variable. 

When A = 1, or when A e (0,1) and N — p c L 3> we may take cll = 1. Otherwise, cll is defined 
by (A.7) in the Appendix (with ax, = 1 — a and 7 = 6/(1 — A)). 

The supercritical case (assertion b) above) takes a particularly simple form when a(t) = 1: then $ ~ 
Af(Q, 1) is a Gaussian variable independent of the Brownian bridge component, and hence BB S + s $ 
is a standard Brownian motion B s . This is the case for the supercritical power law, or for the stretched 
exponential law when — > +00. 

2.3 Discussion of the main results 

As is already summarized in the introduction, Theorems 2. 1 and 2.3 imply a weak law of large numbers 
for the excess mass fraction in the condensate Ml/(N — p c L). The critical scale Al for the excess 
mass, above which a positive fraction of it concentrates on the maximum and forms a condensate 
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Figure 2: Results of Monte Carlo simulations of the zero-range process with rates (1.1) with A = 0.6 and 6 = 2, 
on a one-dimensional lattice with L — 1024 sites and periodic boundary conditions. For these parameter values 
p c = 0.842, a 2 = 2.55 and c\ — 4.09, and we choose N = 1360 particles, which is very close to the leading 
order prediction of 1356 for the critical scale according to (2.36) with 7 = 0. We plot four realizations for the 
accumulated profile Sk = against k, and see that both, fluid and condensed realizations occur. In the 

condensed case, the mass is shared between the condensate and the bulk according to the prediction (2.35) with 
— 3/4, as is indicated by the dashed lines. 



according to (2.21), (2.23) and (2.32), (2.34), is summarized in (1.2). It is of order \JL log L for the 

power law case given precisely in (2.19), and the lighter tails in the stretched exponential case lead to 

1 

a higher scale of order L *+ A given precisely in (2.29). At the critical scale the excess mass fraction 
can take both values with positive probability (cf. (2.24) and (2.37)), depending on sub-leading orders 
as detailed in (2.19) and (2.36). In the power law case, the condensate always contains the full excess 
mass (2.23) as soon as it exists. On the other hand, for stretched exponential tails the excess mass is 
shared between the condensate and the bulk according to (2.34) as long as N — p c L ~ A/,, and the 
fraction a(t) of the condensate gradually increases to 1. This behaviour is illustrated in Figure 1 in 
the introduction. The results on the bulk fluctuations in Theorem 2.5 imply that below criticality the 
excess mass is distributed homogeneously in the system, and that the same holds above criticality in 
the bulk, which completes the above picture. These results are illustrated in Figure 2, where we show 
sample profiles for a zero-range process which show exactly the predicted behaviour already at a rather 
moderate system size of L = 1024. 

The discontinuous formation of the condensate on the critical scale implies that it forms 'spontaneously' 
out of particles taken from the bulk of the system: When crossing the critical scale by adding more mass 
to the system, the number of particles joining the maximum is indeed of higher order than the number 
of particles that have to be added to the system in order to form the condensate. A similar phenomenon 
has been reported for the Ising model and related liquid/vapour systems in [8, 9]. In contrast to these 
results, the condensed excess mass fraction at criticality is not 'universal', but depends on the system 
parameter A according to (2.35). This might seem surprising at first sight, but the rates of the form (1.1) 
introduce an effective long-range interaction when the zero-range process is mapped to an exclusion 
model with finite local state space (see e.g. [14, 15]). 
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In addition to a law of large numbers our results also include limit theorems for the fluctuations of the 
maximum, which are Gaussian above the critical scale (cf. (2.22) and (2.33)), and given by the extreme 
value statistics below criticality. As long as Iiirl^ 00 (N — p c L)/Al < 1, the excess mass does not 
affect the behaviour of the maximum. According to statements a) of Theorems 2.1 and 2.3, Ml scales 
as the maximum of i.i.d. random variables, which is proportional to L^ 1 and (log L) , respectively, 
with limiting Frechet distribution for power law tails (2.20) (cf. [16]) and Gumbel distribution for 
stretched exponential tails (2.31). Theorems 2.2 and 2.4 describe the crossover to the expected Gumbel 
distributions in the subcritical regime, where the marginals have exponential tails. In the power law 
case, the change from Frechet to Gumbel occurs at the critical scale p c L — N ~ xC 6 - 2 )/^- 1 ) according 
to (2.27). In [16] the behaviour of the maximum was predicted for N = pL with p smaller, equal, or 
bigger than p c for the power law case A = 1. Our results provide a rigorous confirmation including 
the stretched exponential case A e (0, 1), together with a full understanding of the crossover from 
subcritical extremal statistics to Gaussian fluctuations in the supercritical regime. 

We point out that at criticality the correlations introduced by conditioning on the total number of 
particles shift from being entirely absorbed by the bulk to being entirely absorbed by the maximum. 
Indeed, when N — p c L < Aj,we know from Theorems 2. 1 a) and 2.3 a) that the maximum behaves as 
the maximum of i.i.d. random variables with distribution . On the other hand, if N — p c L 3> A^, the 
bulk asymptotically behaves as i.i.d. random variables with distribution v < p c following from Theorems 
la and lb in [3], and the discussion after Theorem 2.5. 

In the stretched exponential case, there is another interesting point regarding the centering of the bulk 
variables in the central limit theorem: When the excess mass exceeds A l the typical excess mass in the 
bulk is 



(1 - a L )(N - p c L) 



(N - Pc L) 



A ' 



as follows from the implicit definition (A.7) of (Ll- This is of order at least \fh unless N — p c L > 
£V( 2A ), hence the special centering required in Theorem 2.5 b). In this case the equivalence of ensem- 
bles cannot be extended to the strong form of [3] (Theorem 2b). Note that for A < 1/2 this affects even 
supercritical densities, i.e. N/L — > p > p c . This is why previous results did not cover this case, which is 
summarized in the following simple Corollary of Theorem 2.3, and completes the condensation picture 
for supercritical densities. 

Corollary 2.6 //A e (0, 1/2] and N/L -> p > p c we have 
-M L — > p- pc, 

and the fluctuations around this limit are given by (2.33), with (1 — at) = 0(L~ X ). 

A necessary condition for our results is the existence of finite second moments, and the case A = 1 and 
2 < b < 3 is not covered by this article. The reason we cannot provide results analogous to Theorems 
2.1 and 2.2 is the lack of a precise estimate for the probability of a moderate deviation of the sum in 
that case, similar to the result (2.18) by Doney [13] for square integrable power-law tails. Nevertheless, 
when the excess mass is such that 

P[S L = N] =LP[»fc = [JV- p c L]] (l + o(l)), 

we can still apply Theorem 1 in [3] to obtain a stable limit theorem for the fluctuations of the maximum 
around N — p c L. For instance if 2 < b < 3, the preceding relation is true provided N — p c L'^s>L T=T , 
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and under this condition we get that 

M L ML.iv , , Ml — (N — p c L) ^ L _ N 

- 1 and ; =>■ Gb-i 



N- Pc L L ^r 
where Gb-i is a completely asymmetric stable law with index 6 — 1. 

3 Proofs 

Since the product measures v$ and the conditional distributions /j,l,n are exchangeable and independent 
of the jump probabilities p(x, y), the spatial structure of zero-range configurations is irrelevant for our 
results. In the following we will therefore consider r/i , 772 , ... to be i.i.d. integer valued random variables 
defined in a probability space (fi, J 7 , P), where T is the er-field generated by {%}jeN and P = v$ c . We 
further define 

Pk :=-P[r h = k\. (3.1) 
Note that p n is directly proportional to the stationary weights w(n) in (2.4). Recall the notation 

L 

Pc = E[th] , a 2 = E[ V f] -p 2 c , Sl-V^, M l = . max m , (3.2) 

i=l 

and that the conditional laws are given by pl.n — P[ • | Sl = iV] . We will denote by a;+ = max{i, 0} 
the positive part of a real number x, and by £~ = (— x) + its negative part. 



3.1 Preliminaries 

Our proofs mainly involve explicit estimates and standard large deviations methods. One such technique 
consists in introducing a change of measure that renders the rare event typical. Precisely, given a > 
and set, define a new measure P Q (s) on the cr-field Tl ■= c(r/i, . . . , i]l) by 



.(*) 



d¥ 



^ Ws) 1{ML< - a}€ ' (3 ' 3) 
where the normalization above is given by 

Note that under F a (s) the random variables {i]k}keN are i.i.d., bounded above by a, their mean value 
is given by 

fe<a 

and their variance is given by G 2 a (s) = p' a (s). It is not hard to verify that 

lim Pais) = inf{fc > : pk > 0} and lim p a (s) = sup{fc < a : pk > 0}. 

Since p a {-) is a continuous increasing function, it follows that if N/L is sufficiently close to the mean 
p c of the distribution and a is sufficiently large, there exists an = s* (L, N, a) such that 

N 

Pa(8.) = —. (3.4) 
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The following lemma can be applied to compute the exact asymptotics of the conditional maximum 
when the average is set to be a small perturbation of the mean, using an a priori estimate as input. 

Lemma 3.1 Take N = N(L) such that ^ — > p c and suppose the following conditions are satisfied: 

i) There exists a sequence < oo such that 

lim Hl,n\M l < a LiN ] = 1 and lim - p c ) a L . N = 0. 

L^oo L^oo \ L J 

ii) For each i£| there exists a sequence Pl(x) < cxl,n with Pl{x) — > +oo, such that 

L e s ' k Pk — > e [0,oo) as L ->• oo , 

where = s*(L) is defined as in (3.4) with a — cxl,n ■ Then 
Pl,n [M L < /3 L (x)] — > e"*^ as L -> oo . 



Proof: We begin by showing that under the conditions of the Lemma s+OiL,N — > 0. For ease of notation 
we may write a and /3 as shorthands for o.l,n and /3l(%) respectively. Using the elementary inequality 

(x — y)(e x — e v ) > (x — y) 2 , valid for all x, y > we have for any s > 



^ ' k<a 

k<a k<a. 

> e sJV/i L - j - E (* - ^)P>) + - 2 - s J2( k - T ) V (3-5) 



If we set 

S (T 



2 = 2£(*-£) Pfc + 2(£-„ c ) + 



it follows from (3.5) that p a (s ) > N/L for sufficiently large L, and since p a (-) is increasing we have 
< s . On the other hand, in view of condition (i) in the statement of the Lemma and the finiteness of 
the second moment we have s a — »■ 0. Thus, 

sf^L.N -> 0. (3.6) 
If s* < we still have 

and since all the terms in the preceding sum are non negative, this implies 

s~ -»• 0. (3.7) 



13 



The limits in (3.6), (3.7), together with the dominated convergence theorem and Fatou's lemma imply 
that 

Z a {s*) = ^2 eS * k Pk — > 1 and z p( s *) = ^2 e ' S * k P k — > 1 as L ^ oo. (3.8) 

k<a k<f) 

This in turn gives after another application of the dominated convergence theorem that 

°l( s *) = 7 1 T (fc- ^-) 2 e Stk p k — ><J 2 and ff§(s») — > a 2 as L -> oo. (3.9) 

k<a 

We proceed now with the proof of the assertion of the lemma. Given x € K, write 

F[M L < L (x), S L = N] = Z%(s*)e- s * N Fp(s*)[S L = N] , 

and 

P[M L <a L , N , S L = AT] =^(s*)e- s * JV P a (s*)[5 i = N] . 
By condition f /) in the statement of the lemma we have 

'[M L <0 L (x), S L = N] 



" L * [ * lL - !l - [ - n - F[M L <a L , N ,S L = N] 



Z a {8.)) P q (s*)[5l = N] ' 
By the local limit theorem for triangular arrays (Theorem 1.2 in [11]) and (3.4) and (3.9), we have 



V2nLa 2 P a (s*) [S L = N] — > 1 . (3.11) 

In order to compute the asymptotics of P^(s*)[5l = N] in (3.10) we need to obtain estimates on 

pp(a.)-N/L. 

Pf>{s*) = 



J2k</3 eS * k Pk J2k</3 eS * k Pk 
-0 



N , E P<k <Af -k)e°- k p k 



L Hk<fi eS * k Pk 

and 



L) \ Z {s*) 

L E f 3 <k < a eS ' k Pk 



< 



E (T~ k ) eS ' kpk - (3A2) 



It now follows easily from (3.8), (3.9) and condition (ii) of the Lemma that 



N^ 2 



lim L p fj (s*) - — =0. 
By another application of the local limit theorem for triangular arrays, we get that 



v / 2irLa 2 P /3 (s»)[S* L = N] — >■ 1 as L — > oo , 
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and using condition (ii) (3.10) becomes 

3.2 The power law case 
3.2.1 Proof of Theorem 2.1 

Here TV > p c L and N/L -> p c as L t oo. 

We will use that if A = 1 then p„ ~ A\nT h , and the decomposition (2.19) 

b log log 1/ 7l \ 



□ 



2(6-3) logi logi/ 
that holds for b > 3. 

The proof of the theorem relies on (2.18) and the following two lemmas. 

Lemma 3.2 Let b > 3 and N > p c L be such that the sequence 7^ in (2.19) has a limit 

lim 7 L = 7 e [-00, 00) . Ifa L = then 



a))- 



Lemma 3.3 Suppose b > 3 one/ iV — p c i 3> $L\f~L,for a sequence t?£ — > 00. 77zen as L — > 00 



M L >N- Pc L - d L VL: S l = N 



= A 1 L(N-p c L)- b (l + o(l)). 



Proof of Lemma 3.2: The argument follows the standard approach used for moderate deviations of the 
sum of i.i.d. random variables. Consider e M such that p aL (s*) = ^. Notice that p QL (0) = p c — 
^ai(O) Sfe>Q L — Pc)Pfc < -j- for sufficiently large L, and by (3.6) we must have = o(log L/y/L). 
In particular, we have Ls^ +e — > for all e > 0. Just as in the proof of Lemma 3.1, we may write 

IP [Ml < oil; Sl = N] = ZL L {s*)e- s * N W aL {s*)[S L = N] 

= Z^ L (0) cx P |-£^ 'tp' aL (t)dt^ F aL (s*)[S L =N}, (3.13) 
where we have used the identity 

We now determine the asymptotic order of each term in (3.13). Observe that 

L 

— > 1 as L — > 00 . 



^(0) = i/ 0e [Mi < a L ] = (l - Pt) 

\ k>a L ) 
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Furthermore, if we define h aL (t) = p aL (t) — p aL (0) — to\ (0) we have 

/ tp' aL (t) dt = f ( PaL (t) - PaL (0) - h aL (t)) P ' aL (t) dt 

Jo a a L y J > Jo 

= ^ -rj^l h aL (t) PaL (t)dt. (3.14) 

Using elementary estimates one can show that 

L( Pa M - PaL (0)) 2 (N- Pc Lf 

2<(0) 2a*L {iA " } 

For the rightmost term in (3.14) notice that for all s e [0, s*] we have 

0< k 2 (e sk -l) Pk <Cs f -, 

k<OtL 

for some e > 0, which implies that \a^ L (s) - a^ L (0)| < Cs e . Therefore, 

L £ K L (t)p' aL (t) dt = L £ Qf * ** aL ( 8 ) a 2 aL (0) ds) ul L (t)dt = 0{Lsl + *) -+ 0, 

where in the first equality we used that p' aL (t) = o 1 aL (t). Together with (3. 14), (3.15) this gives 

(N- Pc Lf 



exp j-L tp' aL (t) eft j ~ oxp j- 



as L — > oo . 



2cr 2 L 

The assertion now follows recalling that y / 2irL<j2 ~ (s* )P Qi (s* ) [Sx = AT] — >■ 1 by (3.11). □ 
Proof of Lemma 3.3: Consider a sequence $l as in the statement of the lemma. Then, 

P L := P[M L > N - Pc L - i9 L VI; S l = N] 

~ £p fc P[S , £-i = JV-A:;M i _ 1 <fc]. 

Using the central limit theorem we can see that the contribution to the sum of the terms outside the set 
U L = {k e Z : \N — p c L -k\< -OlVl} is negligible, that is 

Pl=Y. LPkP[S L -i = N - k; M L ^ < k] + o(L(N - p c L)- b ) . 

k£U L 

We can now use the regular variation of p k to get 

P L =A 1 L(N- Pc L)- b ( V[S L -i = N - k; M L _! < k] + o(l)\ 
^keu L ' 

The last sum converges to 1 again by the central limit theorem and the fact that P M L _ X < k 
uniformly for k e Ul, so 

P L =A 1 L(N- Pc L)- b (l + o(l)) 1 
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as asserted. 



□ 



We proceed now with the proof of Theorem 2.1: 
a) The case — > — oo. 

By Lemma 3.2 and (2.18) if N — p c L 3> \f~L or the local limit theorem otherwise, condition ;) of 
Lemma 3.1 is satisfied by oil = VL/\ogL. Consider > such that p aL (s*) — N/L and let 
/3 L (x) = xL~^, x > 0. Then 

L Yl e S * k Pk=L Y P k + L E (e s * k -l)pk 

f3 L (x)<k<a L f3 L (x)<k<a L f3 L (x)<k<a L 

= LF(xL^)+L Y, (e s * k -l)p k -L Y Ph- 

/3i,(x)<fe<ai k>a L 



It is easy to see that the first term above converges to x 1 b and that the last two terms vanish in the 
limit, since s^a^ — > by (3.6). That is 



L (x)<k<a L 



e s ' k Pk 



A 1 



-1-6 



as L — > oo , 



6-1 

which is condition if) in Lemma 3.1. 

b) The case 7l — > +oo. 

This case is essentially treated in [3]. It is shown there (cf. Theorem lb) that when the second term on 
the right hand side of (2.18) dominates the probability of the event {Sl = N}, the variables {rji} aside 
from their maximum become asymptotically independent with distribution v$ c . This entails that for all 

y e K 

-M L -(N- Pc L) 



which is (2.22). 

c) The case — > 7 € 



< 



/ 2 (fa, 



Here JV — p c L ~ cta/ (6 — 3)LlogL , and the two terms in the right hand side of (2.18) are of the same 
order. Precisely, 



V2irLa 2 



exp{ 



(N- Pc L) 2 
2<x 2 L 



} 



( 7 b - i (fe-3)2 



L^i[iV-p CJ L]- b 
It follows by (2.18) and Lemma 3.2 that 



2ttAi 



e -(6-3)7 = . ^ 



(3.16) 



lim inf jiL.N 

L—>co 



M L < a L 



> 



l + £ 7 

On the other hand, applying Lemma 3.3 with -§l <C ^/\ogL we have that 

1 



lim inf /j,l,n 

L— >oo 



M L >7V - p c L-d L ^L 



1 



17 



and by the central limit theorem 



limsup^L.Ar 



M L >N- Pc L + $ l Vl 



= 0. 



The last three relations together imply that 



-ML-^Be( Pj ) with P7 = T ^-. (3.17) 

Note that p 7 as given above satisfies 

'7 — > 00 



r 1 if- 

\0 if- 

This finishes the proof of Theorem 2.1. □ 



P7 ~* 1 h 't 7 -> -00 



3.2.2 Proof of Theorem 2.2 

Here N < p c L and N/L -> p c as L t 00. 

We take oil = 00, so that (z) in Lemma 3.1 is automatically satisfied. It remains to identify the sequence 
Pl — > 00 and the limit $ in condition (;'/) for each case (a), (b) or (c) in the theorem. Note that since 



— =p{s*)=pc + I <r 2 (s)ds, with cr 2 (s)^>(T 
^ Jo 



we have 

N - p c L , , ^ 

s* = 2^(l + o(l)) asi^oo. (3.18) 

(J -L 

a) The case wi — > 0. 

Let f) L (x) = iL 5 ^ . Then 

L J2 e s * k Pk=LF[f3 L (x)]+L £ (e s ' k - l)p k , 

k>P L (x) k>/3 L (x) 



where 



> l J2 ( e "* fe _ ^ Ls * Z fcpfc = ~^ • 

k>P L (x) k>f3 L (x) 

That is, 

x E e s * k Pk^^- b - 

k>fj L (x) 

b) The case ujl — > w > 0. 

As in the previous case, let /3l(x) = xL^ . By the regular variation of the probabilities p/j, 

fc>/3i(x) fc=/3i,(x) + l fe>ML^ 
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We compute the limits of both terms on the right hand side above: 
lim lim L e s ' k ^ T = 

and by (3.18) 

Mi^ 1 „ ML" 



M1j u ' 1V1 Li" ^ s fell 

lim lim L e s * k —r = lim lim L cxp < — cj 7— > —. 

M^ooL-)oo ^ k b M^ooL^oo ; rim J k' 

k=p L (x)+l fc=/3z,(x)+l 

f M 1 

Lim y e~ ut ^dt. 



M— ¥00 _ 

c) The case — > 00. 

Define now a sequence B L by the equation 

(\a.\B L ) b e^ BL = AiLla,! 6 - 1 , (3.19) 

and note that w L — > 00 implies that |s*|£?l — > 00 as well. Let /3l(x) = £?l + - — r . Then 

I s * I 

L ^ e°* k p k =Le°* B <- ]T e-^V 

k>P L (x) k>P L (x) 

k>0 L (x) 



k>/3 L (x) 



k 



fe'>x/|a»| 

as L — > 00, using dominated convergence with |s*|Sj, — > 00. The third line above follows from the 
second one by (3.19). This concludes the proof of the theorem. □ 

3.3 The stretched exponential case 

b 1 — A 

Here we have p„ ~ A>e . The proofs in this case use results from [25], which are summarized 

in the Appendix. 



3.3.1 Proof of Theorem 2.3 

We recall the the notation from equation (2.29) 

N = Pc L + t L (a 2 L)rh . 
a) The case lim sup ti, < c\. 

L— too 
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The second equation in (2.31) that gives the limit theorem for Ml without conditioning is a standard 
computation in extreme value theory. The appropriate scales 

/ 1 _ \ \ T^A „a 
Vl ^ ( — — log £ J , b L ~ -y as L -)• oo 

are chosen so that 

L Pk^e~ x , iGl. (3.20) 

k>yL+xb L 

Let c > 1. According to (A.4) 

¥[M L < cy L ; S L = N] ~ ¥[S L = N] . 

We thus set the sequence oll = cy^, and item ;') in Lemma 3.1 is satisfied. Recall that > since 
N > Pc L. 

If we define 0l{x) = Vl + xb],, we obtain 

lim L } e s * k pk — lim L } pk 

/3 L (x)<k<a L f3 L (x)<k<a L 

= lim L ^ p k - lim L V" p k 

L->oo L — ' L^roa ' — ' 

k>yL+xb L k>cy L 



where the first identity follows from (3.6) and the third one is (3.20). This provides condition if) in 
Lemma 3.1. 

b) The case — > t > c.\. 

When AT — p c L ;» this can be deduced by Theorem la in [3], since in that case the L — 1 smallest 
variables become asymptotically independent. In fact, we can then take oj, = 1. For smaller values of 
N — p c L, even though it is not stated explicitly, this is essentially proved in [25]. Note that by Remarks 
2,3 and 5 in the Appendix for any 9l — > oo we have 



PL,N 



| M L - (1 - a)(N - p c L)\ < 9 L VI 



1. 



where a is implicitly defined by (A. 7). This implies that the conditional distribution of 

— ^ — — - is tight. A careful reading of the proof of Lemma 7, part 2 in [25] 

reveals that in fact this distribution has the asserted limit. Note that the sequence cll in the state- 
ment of Theorem 2.2 is given by 1 — a. The properties of its limit a(t) can easily be deduced from (A. 7). 

(c) The case ti, — > C\. 

This case can be treated analogously to the third part of Theorem 2.1. By (A.l 1), the leading order of 
P[S L = N] is the sum of two explicit terms, and one has to find the precise subscale around N — p c L — 
c\{(j 2 L) 1 ^ where these two terms are of the same order. Using (A.12) for A > 1/2 and (A.7) we find 
that on the scale (2.36) 

1 — / A X L -4!- 7 (i- a) i 



aV2nL / h ^ 2 7A(i-A)L 

V fc 1 + A (l-Q)l + A 

^ -=:V (3.21) 
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From this, (2.37) can be deduced analogous to the proof of Theorem 2. 1 with p 7 = (1 + £ 7 ) 1 . □ 
3.3.2 Proof of Theorem 2.4 

As for the downside deviations in the power law case, we here set = oo. Then i) in Lemma 3.1 is 
satisfied, < 0, and satisfies (3.18). Now the limit in ii), Lemma 3.1 is given by = e~ x in all 
cases, so we just need to prove that the proposed values of the sequence 0l(x) do the job. The proofs 
are all based on the following computation, with simple adjustments to match each situation. 

Let 7l and (l be sequences such that there exist i\ and <2 el with 

lim-fzT=°> limCz,|s*|=4 and lim = £ 2 , £ 1 +M 2 = 1. (3.22) 

L^co 7£ L— s-oo L^co <y£ 

Then 

Il+vCl Il+vCl . . 

k=jL+xCh k=*y L +x( L 

= A x Le s ^-r^<- X £ e^ k ~^ e~^ {k ~ 1L) (l + {l)) 
= A\L e s ^-^l- X " 7 g" CL e -(l-l+^X^) (1 + o(1) ) 

fe IL=x(l 



~A X L^— g / e-'dt. 

I s *l + ^x J x (e 1 +be 2 ) 

We now let y — > oo and apply (3.22) to get 



A X L £ e^e-^- X ^A x L, r ~ e~* . (3.23) 

fc> 7 i+xa \ s *n L + ° 

We will work on this limit case by case. 

Recall from the proof of Theorem 2.3 the existence of sequences 
Vl - —r— lo g X i h h-^T as L ^ oo 



b 



satisfying 



L Pk^e~ x , xeWL. (3.24) 

k>y L +xb L 

a) The case s*yL — > <^=> cjl — > 0. 

Here 7^ := and := satisfy (3.22) with l\ = and ^ 2 = 1/6. In fact, in this case it is 
straightforward from (3.24) and dominated convergence that 

lim L e s ' k p k = e~ x . 

k>VL+xb L 
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b) The case s*y^ — >■ 0. 

Here we need to be slightly more careful with the choice of jl- Set Cl = &l and let -jl be the solution 
to 

A\L b L e s *"< L -^^ X = 1 (3.25) 
The sequences y L and b^ can be chosen so that 

A\Lb L eT^ y ^ = 1, 



and from (3.25) 



b t~X-\ „.1-A\ „ „ A _. b (Vl VL 



Y^- x hi- X -yi- X ) or s*yl = - 



By the condition s*y£ -> this implies that — ~ 1, and (3.22) holds with £i = and £ 2 = 1 /&. Also 
(3.25) and (3.23) imply 

lim L e s * fe p fe = e~ x . 

k>~f L +xbL 

c) The case s*y£ -4 c < 0, c £ K. 

lL Cl 

The scaling in the sequences and (l is preserved, but the limits lim — and lim — need to be 

L— >oo y L L— ¥00 0£ 

chosen so that t\ and ^ 2 in (3.22) satisfy l\ + b£ 2 = 1 and the right hand side of (3.23) equals e~ x , 

r/ie case — > oo. 

Let now Cl = , — r and set 7 l as the solution to 

|s*| 

^ e ^-T^7^ = 1 . ( 3. 26) 

(It is easy to see that such a solution exists). Now taking logarithms in (3.26) we obtain that, to leading 
order, 



A V7l 1l 



1 A logL) 1 " ~ 7L [i + I_*| a ,| 7 * 



1 — A 



6 ° J \ b 

from where we conclude that necessarily |s* | 7 £ — > oo. Then (3.22) holds with l\ = \ and £ 2 = 0, and 
(3.23) follows from (3.26). □ 

3.4 Proof of Theorem 2.5 

a) The assertion will follow from Theorem 24.2 in [7] provided we check the validity of the following 
three conditions for the exchangeable random variables = V "~^!1 L . 
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1. ^— > 0. This is trivial since the sum of all £jj ' N is equal to zero hl,n-&-S- 



2. max t~' — > 

1 l<x<L 1 

Theorems 2.2, 2.4 when AT < p c L. 

L 



0. This follows from part (a) of Theorems 2.1, 2.3 when AT > p c L, or from 



3. ^ (^k'^) 2 1 ; We prove it in detail for the case when the occupation variables rj x follow a 

x=l 

power law, the stretched exponential case being completely similar. 
Let e > and set — as in Lemma 3.2 . Then 



Rl,n ■— ^L,N 



1 1 L 




lis 


(Vx-j; 


x=\ 




1 L 


( N \ 


x=l 


(Vx j;) 



> e 



> e, M L < a L 



Hl,n[M l > a L ] (3.27) 



By Theorem 2.1 a) and Theorem 2.2 the second term on the right side above tends to as L — > oo. Let 
us now write 



a;=l 



> e, M L < a L 





>€,M L <a L ,^ =1 Vx = N 




E L 


._ 1 Vx = N 





< 



p ai («.) 


liE^r^-f) 2 -- 2 


> e 









(3.28) 



where we recall that given parameters a > and set, the measure P Q (s) is defined by (3.3). From 
(3.27), (3.28) and (3.1 1) we conclude that 



R L , N < \ / 2ir<7 2 LP aL (s„) 



1 L 



A* , 



2 -a 2 



a;=l 



> e 



o(l). 



The result will thus follow if we show that 

L 



1 ( N.2 ^ 2 



as L — > oo 



(3.29) 



and 



1 V- / ATs2 2 



as L — > oo . 



(3.30) 
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Let us start with the former. If £ > then 

X>*-^) 2 >(a 2 +e)L 

x=l 



,(*.) 



< e ^ c(ff2+e)L E s 



= e -C(^ 2 +e)L 



^ OIL 



X(fe-f)% S ,fc 



yZ Ql/ (s*) fe=i 

where W* denotes expectation with respect to the measure P aL (s*). 



e Pfe 



(3.31) 



Now set ( = log g L , so that £a| — > 0, and apply the elementary inequality e x < 1 + xip{h) 
for x G [0, ft], where = 6 ~ 1 . We get 



Z aL («*) 



l + C(fc- T )V(Ca!) 



= l + C<(s*MCa!). 
From (3.9) we can make a c \ L (s*)i/>(C a l) < c 2 + e/2 for large enough L, so (3.31) becomes 



X s *) 



.x=l 



<e-«^(l + C^ £ («.MCc4))) 



from where (3.29) is easily obtained. The limit (3.30) can be derived by similar estimates. 



(3.32) 



b) In the power law case (A = 1), or when A < 1 and N — p c L >• , the assertion follows 
immediately (with ol = 1) from the asymptotic independence of the bulk variables proved in Theorems 
lb, la in [3], respectively. 

Let us then consider the stretched exponential case A < 1 when N — p c L = fL(cr 2 L) T + x , and 
ti, — > t € (c\, +oo] (we refer to the statement of Theorem 2.3 for notation). The case TV — p c L 3> 
discussed in the previous paragraph clearly belongs to this family as well. 



We first observe that in this situation 

N - a(t)(N - p c L) < p c L + c x {<j 2 L)Th. 
Indeed, according to (A. 7), a(t) satisfies 



(3.33) 



1 



(l-a(t))a(ty 



t i+\ 



with 7 



1 - A 



7 (1 -A) 

Let Xt = t(l — a(t)). Then it follows from Theorem 2 in the Appendix that x t is the smallest positive 
root of the equation 

b = x t (t-x t )\ (3.34) 
and it is easily checked that 

lim Xt = and lim x t = c\ \ ^ < c\ . 

tfoo t{cx 1 + A 
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In order to conclude (3.33) it will therefore be enough to show that x t is decreasing. Differentiating in 
(3.34) we get, after a couple of operations, that the derivative x' t satisfies 



A 



t - x t 



1 

x t 



A 



t - x t 



Now 



A 1_ 

t-x t x t 



Xx t <t — x t 



Xt(l - a(t)) < ta(t) 



1 + A 



< a(t) . 



But a{t) is increasing on the half-line (c\, oo) with lim t ^ CA a(t) 
a(t) > T A X . 



2A 
1+A 



, lirrit-foo a(t) = 1, and hence 



Inequality (3.33) allows us to decompose the random walk Yl into two components: the first 
term will be easily shown to converge to a Brownian bridge via the same arguments applied to prove 
the first statement in the theorem, while the second one is a drift term determined by the Gaussian limit 
specified in Theorem 2.3 b). Precisely, write 

sL] 



Y L = W L + " aL[N ~ PcL) 



Next, consider the interval Al 



wt 



X 



N — Mr 



(3.35) 



associated to 5l = L 



a L (N- Pc L) 

chosen so that Theorem 2.3 b) implies lim^^co hl,n[Ml <£ Al] = 1- Notice that by (3.33) the 
occupation variables {fj x } x= \^... ^ are in the subcritical regime when Ml e Al, they are clearly 
exchangeable, and moreover when properly centered they satisfy conditions 1), 2) and 3) of Theorem 
24.2 in [7]. 

Namely, let £%> N = ,1x ~ (n ^ Il)/l . Then, provided M L G A L , we can easily show that 



1'- Ef 



0. In fact, the sum equals except in the rare event that the second order 



x=l 



statistic of the sample {r] x }i< x <Lis greater than L 1 / 4 . 

2'. | max ^- N \ ^ 0. This is trivial, as fj x < i 1/4 and N - M L < L^h when M L € A L 



l<x<L 
L 



3'- Ed 



L,N\2 A'-L.n 



1. This can be shown following the arguments applied to prove condition 



x=l 



3) in the first statement of the theorem, the difference being that it is now necessary to condition on Ml 
before applying Chebyshev's inequality: for e > 0, 

L 



fJ-L,N 



x=l 



Vx 



N-M L ,2 



IzEfe 



> e, M L € A L 

N-M LS 2 



x=l 



> e 



Mr 



(3.36) 



The estimates leading to the bound (3.32) hold uniformly for N = N — Ml when Ml G Al, and hence 
can be applied to the conditioned expectation in the right side of (3.36) to conclude that 



tE(^- 



x=l 



> e, M L G A l 



0. 
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as required. Conditions 1'), 2') and 3') imply that Wg BB S , the standard Brownian bridge on 

[0,1] conditioned to return to the origin at time 1. On the other hand, we know form Theorem 2.3 b) 
that — — a i<(N-p c L) m^n ^ a zero mean Q auss j an variable with variance 1/(1 — A ( 1 ~"W) ). 

ya z L 1 v a \ l ) 

Our assertion will follow once we prove the convergence of the finite dimensional marginals 
plus tightness for the laws of the sequence Y S L . The former is easily derived by first conditioning on 
Ml; the fact that the limit of the first term in (3.35) is independent of the value of the second implies 
that the finite dimensional marginal distributions converge to those of BB S + s <E>. Tightness is also 
straightforward: by the linearity of the second term in (3.35), it suffices to show that the modulus of 
continuity of the first term tends to 0, 

to(S) = sup |W a L - W r L | ^4 as ^0, 

\s-r\<5 

which is a direct consequence of the Arzela-Ascoli Theorem. □ 
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Appendix 

Theorem 2.2 for the stretched exponential case relies heavily on the asymptotics for the probability of 
moderate deviations in [25]. An English translation of this article can be found in Selected Translations 
in Mathematics, Statistics and Probability (1973), Volume 11. The purpose of this appendix is to 
provide the main results of this difficult to access article, as they apply in our model. 

We use the same notation as in Section 3, introduced in (3.2), and write the stretched exponential tail of 
the law P of the independent integer random variables r]j, j — 1, 2, . . . , L as 

P[f]j = n] ~ Ae~ inl A as n -> oo , 

where A, 7 are positive constants and < A < 1. We are interested in the asymptotic behaviour of the 
probabilities Pl(N) = P[Sl = N] as L — > 00, where N = p c L + k, and k deviates from the typical 
behaviour k = 0(\/L). This is done in [25] when A = 7 = 1 in a series of five theorems and remarks, 
where the asymptotics are obtained for five different ranges of fc-values. In the following, we transcribe 
these theorems for arbitrary values of A and 7, which are applied in the proofs of Theorems 2.3 and 2.4 
with A = A\ and 7 = 6/(1 - A). 

We will denote by Ol a sequence that increases to infinity arbitrarily slowly. Let r^(m) be a sequence 
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such that 

oo 

L n m e-^ X — -> as L -> oo . (A.l) 

n-rL(fn) 

Note that we may take tl (m) = (/3 log L) 1_A , for any /3 > 7 _1 . We will use A^ (z) to denote the first 
t terms of the Cramer series (see e.g. [19], Chapter 8), where 

\] - 1 and A [t] (z) = A + X lZ + ■ ■ ■ + Xt-iz^ 1 . (A.2) 

In particular, A^ (z) = when A > 1/2. The coefficients Xj depend on the cumulants of the distribution. 
Finally, we define 

c\ = (1 + A)(2A)~tta7TTx ^ 
Case 1° 

6VZ< k < (c A -<5)((t 2 L)tta ) (A.3) 
where 5 > is any sufficiently small fixed number. 
Theorem 1 Ifk is as in (A.3), then 

P L (N) = -J= exp { - ^ + £ AM (|) } (1 + o(l)) asL^^. 



1 , (A.4) 



k ^ m+1 



Remark 1 Under the conditions of Theorem 1 

P[5l = N] 

F[S L =N; Vj < r L (m), 1 < j < L] ~ 

where m can be taken as the smallest positive integer such that N J = 
Case 2° 

T 1/2A 

(c x + S)(a 2 L)Th <k<— — . (A.5) 



Theorem 2 Tjffc is as in (A.5), then 

AL f a 2 k 2 . a 3 fc 3 , m /a/c 

K 

l + A 

as L — > oo, where a is the smallest positive root of the equation 
<j 2 L a(l - a) x , ak 



/c!+ A 7(1 -A) v AV L 



(l-Rx(-r ))• (A.7) 
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The term R\ in the preceding equation is given by 

3=0 

and in particular R\ = if X > 1/2. 

Note that the fraction of excess mass on the condensate in Theorem 2.3 is = 1—a with 7 = b/(l— A). 

j 1 

Relation (A. 6) takes a particularly simple form for A > 1/2 and OlL 1 ^ < k < In this case 

P L (N) =AL CX p{-^-- 7 (1 - a) 1 -^ 1 -*}^ + o(l)). 
Remark 2 Under the conditions of Theorem 2 

L_r J j. x ^ 8) 

LP[S L = iV; Vj < r L (m), 1 < j < L - 1; \ VL - (1 - a)fc| < 9 l Vl] 

where m can be taken as the smallest positive integer such that Lk^^ m+1 ^ = o(l) as L — > 00. 
Case 3° 

k>L^6 L . (A.9) 

Theorem 3 Tjf fc is as in (A.9), fnen 

P L (iV) = ALexp{ - 7 fc 1 - A } (l + o(l)) as 00. 



Remark 3 Under the conditions of Theorem 3 

V\S L = N] 



LV[S L = N; Vj < r L {2), 1 < j < L - 1; \r) L - k\ < 6 L VL\ 
Case 4° 



1 as L — > 00. 



(c A - <5)(<7 2 L)t+a < fc < ( CA + (5)(cr 2 L)T+^. (A. 10) 

Theorem 4 Tffc is as in (A. 10), then 

AL ( a 2 k 2 a 3 k 3 , t] /ak 



T „ (?) V-}( 1+ „ (1) ) 

V fel+A(l_ a )l + A 

+ ^E CX "{-2^ + ^ , ' , (l)}( 1+ '" 1 » " S £ — » 00 , (A.H) 



where, as before, a is the smallest positive root of equation (A.7). 
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Relation (A. 11) takes a simple form for A > 1/2. In this case 

AL r a 2 k 2 



V fei+A(l_ a )i+A 

Remark 4 77tis case is intermediate between cases 1 and 2. Under the conditions of Theorem 4 we can 
only state that 

P[S L = N] 

LF[S L =N; n, < r L (m), j < L-l; \r] L -(l-a)k\ < L VZ\ + V[S L =N; Vj < r L (m),Vj] ~~ ^ ' 

vv/zere m = t + 2. 
Case 5° 

-^-<k<L^9 L . (A.13) 

PL 

Theorem 5 /ffc is as in (A.9), then 



Pl(N) = AL cxp { - 7 fc 1 - A + L(1 2fc2 A A } g }(1 + o(l)) as L 



00 . 



Remark 5 In this case the picture is the same as in Remark 3. 
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